[Refactor] Remove redundant StageDeployConfig fields, delegate to vLLM defaults by gcanlin · Pull Request #3128 · vllm-project/vllm-omni

gcanlin · 2026-04-25T02:58:12Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

Remove fields from StageDeployConfig that duplicate vLLM EngineArgs defaults:

gpu_memory_utilization
tensor_parallel_size
enforce_eager
max_num_batched_tokens
max_model_len
async_scheduling

These fields now flow through engine_extras and inherit vLLM's hardware-specific defaults. Existing YAML configs remain compatible as unrecognized fields automatically enter engine_extras.

Retained fields with vllm-omni specific behavior:

max_num_seqs (default 64 vs vLLM's 256)
devices (stage device assignment)
default_sampling_params / subtalker_sampling_params (per-stage sampling)

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan. Please provide the test scripts & test commands. Please state the reasons if your codes don't require additional test scripts. For test file guidelines, please check the test style doc
The test results. Please paste the results comparison before and after, or the e2e results.
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model. Please run mkdocs serve to sync the documentation editions to ./docs.
(Optional) Release notes update. If your change is user-facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

…M defaults Signed-off-by: gcanlin <canlinguosdu@gmail.com>

chatgpt-codex-connector · 2026-04-25T02:58:16Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

gcanlin · 2026-04-25T03:04:34Z

I found that this field is changing the default value from vllm. And it produces the different behavior before stage config refactoring. So I think we don't need to set the default on vllm-omni side. vLLM has the selector method for this field:

https://github.com/vllm-project/vllm/blob/e54894fc85a9861fb38a49701b5844462c3d77e4/vllm/engine/arg_utils.py#L2179-L2260

gcanlin · 2026-04-25T03:04:52Z

cc @amy-why-3459

amy-why-3459 · 2026-04-25T03:09:46Z

cc @amy-why-3459

I completely agree with your point of view. I believe that the default parameters should be consistent with vLLM; otherwise, performance will be affected by the inconsistency in default parameters.

amy-why-3459 · 2026-04-25T03:46:54Z

If it involves modifying deployment parameters, I suggest testing with nightly-test.

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

gcanlin · 2026-04-25T07:32:11Z

cc @yenuo26

yenuo26 · 2026-04-25T07:41:24Z



-DEPLOY_CONFIGS_DIR = Path(__file__).parent.parent / "deploy"
+DEPLOY_CONFIGS_DIR = Path(__file__).resolve().parents[4] / "vllm_omni" / "deploy"


maybe you can use get_deploy_config_path in tests/helpers/stage_config.py

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

yenuo26 · 2026-04-25T07:50:41Z

-            "stage_args": {
-                0: {"engine_args.enable_prefix_caching": True},
+            "stages": {
+                0: {"enable_prefix_caching": True},


Does this not support passing arguments from the command line?

Sounds good. But would be better to unify it in a follow-up PR.

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

xiaohajiayou · 2026-04-27T17:26:28Z

    stage_id: int
-    max_num_seqs: int = 64
-    gpu_memory_utilization: float = 0.9
-    tensor_parallel_size: int = 1


For the requirement where fields defined in deploy YAML are selectively overridden,
these fields still need to be retained as a whitelist to preserve the override semantics.

The core issue here is handling default values. A more consistent approach would be to treat
all values as optional (i.e., default to None), and introduce a filtering
step in deploy_create_from_registry(), such as:

explicit_overrides = {k: v for k, v in override_fields.items() if v is not None}

This ensures that only values explicitly provided in the deploy YAML are included in the resulting StageConfig, while missing fields are left unset and handled by downstream default resolution.

With this design, as discussed in #3162:

For LLM stages, missing fields will fall back to the default values defined in vLLM's EngineArgs.

For diffusion stages, missing fields will be handled by _create_default_diffusion_stage_cfg,
which provides safe defaults.

This unifies the override behavior while delegating default resolution to the appropriate
downstream layer.

hsliuustc0106 · 2026-04-28T00:29:59Z

lgtm, the CI failure may not relate to this PR, resolve conflicts please

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

Gaohan123 · 2026-05-05T09:15:22Z

Fix conflicts and pre-commit problems please

[Refactor] Remove redundant StageDeployConfig fields, delegate to vLL…

e69079d

…M defaults Signed-off-by: gcanlin <canlinguosdu@gmail.com>

gcanlin requested a review from hsliuustc0106 as a code owner April 25, 2026 02:58

gcanlin requested review from hsliuustc0106, linyueqian, lishunyang12 and princepride and removed request for hsliuustc0106 April 25, 2026 02:58

gcanlin changed the title ~~[Refactor] Remove redundant StageDeployConfig fields, delegate to vLL…~~ [Refactor] Remove redundant StageDeployConfig fields, delegate to vLLM defaults Apr 25, 2026

gcanlin added the ready label to trigger buildkite CI label Apr 25, 2026

gcanlin commented Apr 25, 2026

View reviewed changes

lishunyang12 approved these changes Apr 25, 2026

View reviewed changes

gcanlin added omni-test label to trigger buildkite omni model test in nightly CI nightly-test label to trigger buildkite nightly test CI and removed omni-test label to trigger buildkite omni model test in nightly CI labels Apr 25, 2026

fix

8090a08

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

gcanlin added omni-test label to trigger buildkite omni model test in nightly CI and removed nightly-test label to trigger buildkite nightly test CI ready label to trigger buildkite CI labels Apr 25, 2026

gcanlin added 2 commits April 25, 2026 07:27

fix

7750057

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

Merge branch 'main' into pipeline-clean

3a765d6

yenuo26 reviewed Apr 25, 2026

View reviewed changes

fix

177d1d3

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

yenuo26 reviewed Apr 25, 2026

View reviewed changes

hsliuustc0106 added the ready label to trigger buildkite CI label Apr 25, 2026

unify qwen3-omni config

f7074a9

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

hsliuustc0106 mentioned this pull request Apr 26, 2026

[Entrypoint][Refactor] Make field type hint more concrete #3139

Merged

5 tasks

xiaohajiayou reviewed Apr 27, 2026

View reviewed changes

Merge branch 'main' into pipeline-clean

4d034b3

gcanlin force-pushed the pipeline-clean branch from 507b47c to 4d034b3 Compare April 28, 2026 01:44

gcanlin enabled auto-merge (squash) April 28, 2026 12:26

gcanlin disabled auto-merge April 28, 2026 12:26

Merge branch 'main' into pipeline-clean

491b09a

hsliuustc0106 removed ready label to trigger buildkite CI omni-test label to trigger buildkite omni model test in nightly CI labels Apr 29, 2026

Merge branch 'main' into pipeline-clean

a6d4eb4

hsliuustc0106 added the high priority high priority issue, needs to be done asap label Apr 30, 2026

gcanlin added 2 commits May 2, 2026 06:58

Merge branch 'main' into pipeline-clean

f3441dc

revert

3ccf735

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

gcanlin added the omni-test label to trigger buildkite omni model test in nightly CI label May 2, 2026

gcanlin and others added 5 commits May 3, 2026 11:32

Merge branch 'main' into pipeline-clean

b1b072f

fix

df78402

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

Merge branch 'main' into pipeline-clean

b5b7b43

fix

1c32d30

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

fix

bc0b11e

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

hsliuustc0106 added the ready label to trigger buildkite CI label May 3, 2026

gcanlin added 2 commits May 4, 2026 01:36

Merge branch 'main' into pipeline-clean

8c71834

fix acc

e866167

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

xiaohajiayou mentioned this pull request May 4, 2026

[BugFix] Fix Whitelist optimization CI failure #3290

Merged

5 tasks

Gaohan123 removed ready label to trigger buildkite CI high priority high priority issue, needs to be done asap omni-test label to trigger buildkite omni model test in nightly CI labels May 6, 2026

Gaohan123 modified the milestones: v0.20.0, v0.22.0 May 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactor] Remove redundant StageDeployConfig fields, delegate to vLLM defaults#3128

[Refactor] Remove redundant StageDeployConfig fields, delegate to vLLM defaults#3128
gcanlin wants to merge 18 commits into
vllm-project:mainfrom
gcanlin:pipeline-clean

gcanlin commented Apr 25, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot commented Apr 25, 2026

Uh oh!

gcanlin Apr 25, 2026 •

edited

Loading

Uh oh!

gcanlin commented Apr 25, 2026

Uh oh!

amy-why-3459 commented Apr 25, 2026

Uh oh!

amy-why-3459 commented Apr 25, 2026

Uh oh!

gcanlin commented Apr 25, 2026

Uh oh!

yenuo26 Apr 25, 2026 •

edited

Loading

Uh oh!

yenuo26 Apr 25, 2026

Uh oh!

gcanlin Apr 28, 2026

Uh oh!

xiaohajiayou Apr 27, 2026 •

edited

Loading

Uh oh!

hsliuustc0106 commented Apr 28, 2026

Uh oh!

Gaohan123 commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants



		DEPLOY_CONFIGS_DIR = Path(__file__).parent.parent / "deploy"
		DEPLOY_CONFIGS_DIR = Path(__file__).resolve().parents[4] / "vllm_omni" / "deploy"

Conversation

gcanlin commented Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot commented Apr 25, 2026

Uh oh!

gcanlin Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gcanlin commented Apr 25, 2026

Uh oh!

amy-why-3459 commented Apr 25, 2026

Uh oh!

amy-why-3459 commented Apr 25, 2026

Uh oh!

gcanlin commented Apr 25, 2026

Uh oh!

yenuo26 Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yenuo26 Apr 25, 2026

Choose a reason for hiding this comment

Uh oh!

gcanlin Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

xiaohajiayou Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hsliuustc0106 commented Apr 28, 2026

Uh oh!

Gaohan123 commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

gcanlin commented Apr 25, 2026 •

edited

Loading

gcanlin Apr 25, 2026 •

edited

Loading

yenuo26 Apr 25, 2026 •

edited

Loading

xiaohajiayou Apr 27, 2026 •

edited

Loading